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MAPS ON POSITIVE DEFINITE MATRICES PRESERVING BREGMAN 
AND JENSEN DIVERGENCES 

LAJOS MOLNAR, JOZSEF PITRIK, AND DANIEL VIROSZTEK 


Abstract. In this paper we determine those bijective maps of the set 
of all positive definite n x n complex matrices which preserve a given 
Bregman divergence corresponding to a differentiable convex func¬ 
tion that satisfies certain conditions. We cover the cases of the most 
important Bregman divergences and present the precise structure of 
the mentioned transformations. Similar results concerning Jensen di¬ 
vergences and their preservers are also given. 


1. Introduction 

In a series of papers EM, 121 ]9) the first author and his coauthors de¬ 
scribed the structures of surjective maps of the positive definite cones in 
matrix algebras, or in operators algebras which can be considered gener¬ 
alized isometries meaning that they are transformations which preserve 
"distances" with respect to given so-called generalized distance mea¬ 
sures. This latter notion stands for any function d : SC x SC —>■ [0,oo) on 
any set SC with the mere property that for x, y e SC we have d{x, y) = 0 if 
and only if x - y. We recall that in several areas of mathematics not only 
metrics are used to measure nearness of points but also more general 
functions of this latter kind. 

In [ 12l|9| the considered generalized distance measures are of the form 
d = d,M'g, where N(.) is a unitarily invariant norm on the underlying ma¬ 
trix algebra or operator algebra, g : (0,oo) -*■ C is a continuous function 
with the properties 

(al) g(y) = 0 if and only if y = 1; 

(a2) there exists a constant K > 1 such that | g (y 2 ) \ > K | g(y) |, y > 0, 
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and the generalized distance measure d.N,g is defined by 


(1) d N , g (X,Y) = N[g[Y- ll2 XY~ 1 ' 2 )) 


for all positive invertible elements X, Y of the underlying algebra. In the 
mentioned papers one can see several important examples of that sort 
of generalized distance measures, many of them having backgrounds in 
the differential geometry of positive definite matrices or operators. The 
basic tools in describing the structure of the corresponding generalized 
isometries have been so-called generalized Mazur-Ulam type theorems 
and descriptions of certain algebraic isomorphisms (Jordan triple iso¬ 
morphisms) of the positive definite cones in question. 

In the present paper we determine the structures of generalized isome¬ 
tries with respect to other important types of generalized distance mea¬ 
sures. Namely, here we consider Bregman divergences and Jensen di¬ 
vergences. These types of divergences have wide ranging applications 
in several areas of mathematics. For example, in the recent volume [GO 
on matrix information geometry 3 chapters are devoted to the study of 
Bregman divergences. One feature of Jensen divergences which justifies 
their importance is that Bregman divergences can be considered as as¬ 
ymptotic Jensen divergences (see Section 6.2 in [13]). We further mention 
that the famous Stein’s loss and Umegaki’s relative entropy are among the 
most important Bregman divergences. Our basic tool in this paper to de¬ 
termine the corresponding preserver transformations is, just as above, 
also algebraic in nature but rather different from what we have men¬ 
tioned in the previous paragraph. Namely, here we use order isomor¬ 
phisms. 

Before presenting the results we fix the notation and terminology. In 
what follows y„ denotes the algebra of all n x n complex matrices and 
stands for the positive definite cone in M„, i.e., the set of all positive 
definite matrices in M n . Whenever convenient, in the paper we use the 
equivalence of the language of matrices and that of linear operators on 
C n . 

We next define the two basic concepts what we consider in this pa¬ 
per, Bregman divergence and Jensen divergence. Both concepts are con¬ 
nected to convex real functions. Let / be a convex function on the in¬ 
terval (0,oo). It is known that / is necessarily continuous and the set of 
points where / is differentiable has at most countable complement. It is 
a remarkable fact that if / is everywhere differentiable then its derivative 
f is automatically continuous (see e.g. [151 Corollary 25.5.1]). 
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For a differentiable convex function / on (0,oo), the Bregman /- 
divergence on P„ is 

H f {X, Y) = Tr (/(X) - /(7) - /'(7)(X - X)), X, 7 e P n , 

see e.g. formula (5) in |T4) . If lim x ^ 0 + fix) and lim x ^ 0 + fix) exist, 
then /, /' have continuous extensions onto [0, oo) and the Bregman /- 
divergence is well-defined and finite for any pair of positive semidefinite 
matrices, too. 

For a convex function / on (0,oo) and for given A e (0,1), the Jensen 
A - /-divergence on P„ is defined by 

Jf,\{X, Y) = Tr (A/(X) + (1 - A)/(7) - / (AX + (1 - A) 7)), X, 7 e P n . 

If lim x ^ 0 + /(x) exists, then the Jensen A - /-divergence is also well- 
defined and finite for any pair of positive semidefinite matrices. 

It is well-known that Hf and Jj\\ are always nonnegative and they are 
generalized distance measures (in the sense we use in this paper) if and 
only if / is strictly convex. 

Our main aim is to describe the "symmetries" of the positive definite 
cone P n that preserve above type of divergences. This means that we are 
looking for the structure of all bijective maps <p: -* P n for which 

H / (0(X),d>m) = H / (X,F), X,Y eP„ 

or 

h,f (0(X),d>(7)) = / A>/ (X, Y), X,YcP n 

holds. 


2. The results 

We now turn to our results. It is useful to examine first the question that 
how different the present problem is from the ones we have considered in 
the papers (3][8, I2J19]. To see clearly the differences we determine below 
which Bregman divergences, respectively which Jensen divergences are 
of the form CD. Let us begin with the case of Bregman divergences. 

One of the most important Bregman divergence is the one usually 
denoted by l which corresponds to the strictly convex function /(x) = 
- logx, x > 0 and is called Stein’s loss. Apparently, we have 

Z(X, Y) = -Tr(logX-log7- 7 _1 (X- 7)) = TrX7 _1 -logdetX7 _1 - n 

for all X, 7 e P n , where we have used the identity Tr o log = logo det on P„. 
On the other hand, the continuous function g(y) = y - logy - 1, y > 0 is 
nonnegative, satisfies the conditions (al), (a2) (with constant X = 2), and 
with the trace norm ||. || i we easily get 

4|| 1)g (X, 7) = Ig (7- 1/2 X7" 1/2 ) I, = Z(X, 7), X, 7 e V n . 
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In the next proposition we prove that Stein’s loss is essentially the only 
Bregman divergence on which is a generalized distance measure of 
the form CQ . Observe that any distance measure of the form £D is invari¬ 
ant under multiplication of the variables X, Y by the same positive scalar 
t. 

Proposition 1. Letf be a differentiable convex function on (0, oo). Assume 
that the Bregman f-divergence Hf on P n is homogeneous of degree 0, i.e., 
it satisfies 

(2) Hf{tX,tY) = H f {X,Y) X,YeP n ,t> 0. 

Then we have f{x) = alogx + bx + c, x > 0 with some real scalars a, b, c 
and a< 0. 

Proof We plug scalar multiples of the identity X = xI,Y = yI, x,y > 0 
into the equality © and obtain 

(3) /(fx)-/(fy)-/'(fy)f(x-y) = /(x)-/(y)-/'(y)(x-y), t> 0. 
Choosing t=l/y and reordering this equality we have 

fiy) = (l/(x- y)) ifix) - fiy) - f(x/y) + /(1) + f'i 1)((x/y) - 1))) 

implying that / is twice continuously differentiable. Differentiating © 
with respect to x twice we obtain f'{x) = f"(tx) t 2 , x,t> 0. In particular, 
it follows that fit) is a constant multiple of t~ 2 , t > 0. We infer that 
fit) = rzlog t+bt + c, t> 0 holds with some real constants a, b, c. By the 
convexity of / we have a < 0. □ 

Concerning the appearance of the function x —>■ bx + c in the above 
proposition we note that adding an affine function to a given convex 
function / does not change the corresponding Bregman divergence (the 
same holds for Jensen divergence, too), hence that part simply does not 
count. 

We next examine the case of Jensen divergences. Again, let fix) = 
-logx, x > 0 and pick A e (0,1). It is easy to see that the corresponding 
Jensen divergence is 

/_tog,A iX, Y) = logdet (AX + (1 - A) Y) - logdetX 1 y 1_A , X,Y e P n 
which can also be written as 

J-io g , A (X, Y) = logdet (AX“ 1/2 Xy~ 1/2 + (1 - A)/) - logdet [Y~ ll2 XY~ ll2 ) X 

= Tr (log((AX“ 1/2 Xy _1/2 + (1 - A)/) (F“ 1/2 X7“ 1/2 )“ A )), X, Y e P n . 

These divergences are scalar multiples of the Chebbi-Moakher log- 
determinant a-divergences, see (21112]. As mentioned in [12] (see pages 
146-147) the continuous function g\iy) = log((Ay + (1 - A))/y A ), y > 0 is 
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nonnegative, satisfies (al), (a2) (with constant K = 2) and, moreover, we 
have 

/_i og , A (X,y) = ||g(y- 1/2 XF- 1/2 )|| 1 , X,YeP n . 

This means that the Jensen divergences /-i 0 g,A are also of the form 0. 

Let us insert the remark here that in the particular case A = 1/2 the 
above Jensen divergence was considered and called S-divergence in the 
paper |T6| of Sra. He proved the interesting fact there that the square root 
of this divergence is a true metric and proposed to use it as a convenient 
substitute for the geodesic distance dy 2 > i og (II -II2 standing for the Hilbert- 
Schmidt or Frobenius norm) originating from the natural Riemann geo¬ 
metric structure onP„. 

Continuing the discussion, above we have seen that the divergences 
/-log,A are of the form 0. In what follows we show that they are essen¬ 
tially the unique Jensen divergences with this property. 

Proposition 2. Let f be a convex function on (0,oo) and pick a number 
A e (0,1). Assume that the corresponding Jensen A - f divergence is homo¬ 
geneous of order 0, i.e., it satisfies 

(4) J f}X {tX,tY) = J fiX [X,Y) X,YeP n ,t> 0. 

Then we have f{x) = alogx + bx + c, x > 0 with some real scalars a, b, c 
and a< 0. 

Proof Just as above, plugging scalar multiples of the identity X = xI,Y = 
yl, x, y > 0 into the equality 0 we have 

A fltx) + (1 - A )flty) - flXtx + (1 - A) ty) 

,JI = A/(x) + (l-A)/(y)-/(Ax+(l-A)y), t>0. 

We assert that / is differentiable. In fact, choosing x = 1 we have 
fit) = (1/A) (A/(l) + (1 - A)/(y) - /(A + (1 - A)y) 

-(1 - A)/(fy) + /(Af + (1 - A) ty)). 

The result 0 11.3. Theorem] of Jarai on the regularity of solutions of 
functional equations applies and tells us that / is necessarily continu¬ 
ously differentiable. Differentiating 0 with respect to t we have 

0 = A f'ltx)x+ (1 - X) f'[ty)y- flXtx + (1 - A) fy)(Ax + (1 — A)y) 

implying the equality 

Xf'ltx)x+ (1 - A )f'{ty)y = f'{Xtx+ (1 - A) fy)(Ax+ (1 - A)y) 
for all positive t, x, y. Choosing t- 1 we infer 

A/'(x)x+ (1 - A )fly)y = fl Ax+ (1 - A)y)(Ax+ (1 - A)y). 
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This means that the function h{x) = f'{x)x, x > 0 is affine and hence it is 
of the form h{x) = bx + a, x > 0. It follows that fix) = a\ogx + bx + c, x > 0 
with some real scalars a, b, c. Again, the convexity of / implies a < 0. □ 

The above results can be considered as characterizations (hope¬ 
fully a bit interesting) of the Stein’s loss and the Chebbi-Moakher log- 
determinant a-divergences. 

We now turn to the descriptions of the corresponding generalized 
isometries, i.e., transformations preserving the considered divergences. 
We begin with a general result relating to Bregman divergences. 

We first mention the easy fact that both kinds of divergences are clearly 
invariant under unitary and antiunitary congruence transformations. 
These are maps of the form A >— UAU*, where U is a unitary or an an¬ 
tiunitary operator on C”. In fact, this follows from the following. For any 
continuous function / on (0,oo) and for any unitary or antiunitary op¬ 
erator U on C n we have that f{UAU*) = Uf{A)U* holds for every pos¬ 
itive definite A. This is the consequence of the fact that on the spec¬ 
trum of A the function / coincides with a polynomial p and hence we 
have f {UAU*) = p{UAU*) = Up{A)U* = Uf{A)U*. Our theorems be¬ 
low show that in many cases only the unitary and antiunitary congruence 
transformations preserve the Bregman divergence. 

Theorem 3. Let f be a differentiable convex function on (0,oo) such that 
f is bounded from below and unbounded from above. Letup: P„ — P„ be 
a bijective map which satisfies 

H f {(p{A),(f){B)) = H f {A,B), A,B e P„. 

Then there exists a unitary or antiunitary operator U : C n —> C n such that 
<p is of the form 

(p{A) = UAU*, AeP n . 

Proof First we recall that since / is convex and everywhere differen¬ 
tiable, f is continuous. By the convexity of /, its derivative f is mono- 
tonically increasing. The assumption that f is bounded from below im¬ 
plies that lim A ^ 0 + f'(x) exists and is finite. Hence f can be continu¬ 
ously extended onto [0,oo). The same holds for /, too. Indeed, fix) = 
/(1) + f* f r {t)dt, x > 0 and ff f'{t)dt is convergent as x tends to 0 by the 
boundedness of f on (0,1]. 

In the rest of the proof we shall need the following characterization of 
the usual order < on P„. 

Claim A. Let B,C e P„. The set 

(6) {H f {B,A)-H f {C,A)\AeP n } 

is bounded from below if and only if B < C. 
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To prove the claim we first compute 

H f {B,A)~ H f (C, A) 

= Tr f{B) - Tr /(A) - Tr /'(A) {B - A) 

( 7 ) 

-Tr/(C)+Tr/(A)+Tr/'(A)(C-A) 

= Tr f{B) - Tr/(C) + Tr /'(A) {C-B). 

Let k denote a lower bound of /'. Assume B < C. Then by the inequality 

f'{A) > kl, AeP n , 

we have that 

Trf{A){C-B) >TrkI{C-B) = kTr{C-B) 

holds for every A e which shows that the set © is bounded from be¬ 
low. Conversely if B g C, then there exists a unit vector x e C” such that 
(Bx, x) > (Cx, x). Let P x denote the orthogonal projection onto the one¬ 
dimensional subspace generated by x. For any f>0we have 

( 8 ) 

Tr f{tP x + (7 - P x )) (C - B) = f{t) <(C - B)x, x) + f'{ 1) Tr (7 - P x ) {C-B). 

Since <(C -B)x,x) <0 and {/'(f) 11 > 0} is unbounded from above, hence 
the first term on the right hand side of © is unbounded from below. By 
0, it follows that that the set 

{Hf{B,A)-H f {C,A)\AeP n } 

is unbounded from below. This proves our claim. 

Since the bijective map <p : P n —>■ P n preserves the Bregman f- 
divergence, using the above characterization of the order we obtain that 
(f> is an order automorphism, i.e., for any B,C eP n we have B < C if and 
only if <p[B) < 0(C). 

By the result 0 Theorem 1] of the first author, 0, just as any order au¬ 
tomorphism of P n , is of the form 

0(A) = TAT*, A e P n , 

where T is an invertible linear or conjugate-linear operator on C'L We 
may suppose that T is linear, since if we are done with this, the case of 
conjugate-linear T is not difficult to handle. 

We show that T is unitary. Assume on the contrary that T is not unitary. 
Then T* T / 7. Consider the polar decomposition T = UP of T where P = 
VT*T is positive definite and U = TP~ ] is unitary. Since the Bregman 
/-divergence is invariant under unitary congruences, hence we have 

H f {A,B) = H f (0(A),0(B)) = H f [TAT*, TBT *) 

= H f ( UPAPU*,UPBPU *) = H f {PAP, PBP) 


( 9 ) 
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for all A,B e P n . Since T is not unitary, hence P / I, and thus P has an 
eigenvalue different from 1. Without serious loss of generality we may 
assume that P has an eigenvalue greater than 1 for the following rea¬ 
son. The map A PAP is a bijection of which preserves the Bregman 
/-divergence. Therefore, the inverse transformation A >-> P _1 AP _1 pre¬ 
serves the Bregman /-divergence as well. If P does not have any eigen¬ 
value greater than 1 ,P~ l must have one. 

Suppose that Pv = Av for some A > 1 and unit vector v £ C”. Let Q v 
be the orthogonal projection onto the one-dimensional subspace gen¬ 
erated by v. By J9J, the transformation A >-* PAP preserves the Bregman 
/-divergence and so does any of its powers A P n AP n . Hence 

H f (A 2 Q v , Q v ) = H f (. P n A 2 Q v P n ,P n Q v P n ) 


( 10 ) =H f {A 2[n+1) Q v ,A 2n Qv) 

holds for any neN. 

We now consider the symmetrized Bregman /-divergence Hf{A,B ) + 
Hf{B, A ) which can be written in the following convenient form 

H f {A, B ) + H f {B,A) = Tr/(A) - Tr f{B) - Tr f'{B) {A - B ) 

+ Tr/CB) - Tr/(A) -Tr f (A)(B - A) = Tr [f {A) - f (B)) {A- B). 

By nUt we have 

^/(A 2 Q„Q,) + H / (Q„A 2 Q,) 

= H f (A 2(w+1) Q v , A 2n Q v ) + H f (A 2n Q v , A 2[n+l) Q v ) 

= Tr (/'(.A 2(w+1) Q v ) - f\A 2n Q v )) (A 2(n+1) Q v - A 2n Q v ) 

= [f (A 2(n+1) ) - /' (A 2 ”)) A 2 ” (A 2 - i) 

for any ntN. This means that (/'(A 2(,2+ 0) - /' (A 2 ”)) A 2 ” is independent 
of n, that is, 

f{A 2[n+l) )-f{A 2n ) = 


(A 2 )' 


holds for some constant c. Therefore, 


lim /'( x)= lim f'(A 2[n+1] )= lim /'(l) + Y if fA 2(fc+1) ] - / fA 2fc )) 

x —*oo n — > oo n — *oo I V V ) V )) 

V A:=0 


= /'(1) + lim V- - 

n ^°° k=o (A 2 ) 


= /'(l) + c y (A -2 ) < oo. 
«=o 


which contradicts the assumption that f is unbounded from above. The 
proof of the theorem is complete. □ 
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The probably most important Bregman /-divergences on ¥ n corre¬ 
spond to the following functions: x >-* x p {p > 1), x >-► xlogx - x, x >-* 
-logx. Unfortunately, the general theorem above covers only the case 
of the first type of functions. Indeed, the second function has derivative 
which is neither bounded from below nor unbounded from above and 
the derivative of the third one is not bounded from below. Fortunately, 
the Bregman divergence related to the third function, i.e., Stein’s loss is 
of the form CO and the corresponding preservers were characterized in 
KT2l . By 112, Theorem 2] a surjective map 0 : P n -*■ P„ preserves the Stein’s 
loss if and only if there is an invertible linear or conjugate linear operator 
T : C” —» C” such that 0 is of the form 

0(A) = TAT*, AeP n . 

In what follows we characterize the preservers of Umegaki’s relative en¬ 
tropy or, in other words, von Neumann divergence which is the Bregman 
divergence corresponding to the function x >-* xlog x - x. It is clear that 
this divergence equals 

Tr(A(logA-logB) - (A-B)), A,BeV n . 

The result reads as follows. 

Theorem 4. Lettp : —*■ P n be a surjective map which satisfies 

Tr (log0(A) - log0(B)) - (0(A) - 0(B))) 

!11! =Tr(A(logA-logB)-(A-B)), A, Be P„. 

Then there exists a unitary or antiunitary operator U: C” -» C" such that 
0 is of the form 

(p{A) = UAU*, A eP n . 

Proof We begin with the following general observation. Assume that / is 
a strictly convex differentiable function on (0,oo). We assert that for any 
B,Ce P0, the set 

{H f {A,B)-H f {A,C)\AeP n } 

is bounded from below if and only if /'(B) < /'(C). Indeed, we have 

H f {A,B)-H f {A,C ) 

= Tr (/'(C) - /'(B)) A + Tr (/(C) - /(B) - (/'(C)C - /'(B)B)) 
which is easily seen to be bounded from below if and only if /'(C) - 
/'(B) >0. 

Therefore, for any surjective (and hence, by the strict convexity of /, 
bijective) map 0 : P n —► P n which preserves the Bregman /-divergence, 
we obtain that 0 has the property that 

/'(B) < /'(C) « /' (0(B)) < /' (0(C)). 
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This means that the transformation A >-> f l<p ^/ ,_1 (A)jj is an order au¬ 
tomorphism of the set of all self-adjoint operators on C” with spectrum 
contained in the range of f. 

In our particular case we have /' = log, therefore A \og[<p[e A )) is 
an order automorphism of the set of all self-adjoint operators on <C”. By 
(6] Theorem 2] we have an invertible linear or conjugate-linear operator 
T : C” -* C” and a self-adjoint linear operator X : C” —>• C” such that 

log(0(e A )) = TAT*+ X 

or 

(12) 0(A) = e T ^ AT * +x t a e P>„. 

We assume that T is linear, the conjugate-linear case is similar, not diffi¬ 
cult to handle. Consider the polar decomposition T = UP of T where U 
is unitary and P = \/T*T is positive definite. As already mentioned, the 
unitary similarity transformation A UAU* preserves Bregaman diver¬ 
gences. Since 

0(A) = e UP[l °z A)PU * +x = Ue p ^P + u*xu u% AePn> 

it follows that in Q2) we can and do assume that T is a positive definite 
operator. We prove that necessarily T = I holds. To see this, first assume 
that T has an eigenvalue which is greater than 1. We know that 

(13) 

Tr(e ra ° gA)r+x (r(logA)r- T (\ ogB ) T ) _ ^ e T{iogA)T+x _ e raogB)r+xjj 

= Tr(A(logA- log-B) - (A - B)), A, Be P„. 

Fixing A, B, write tB in the place of B where t > 0 is arbitrary. The above 
equality gives us that 

(14) a\ogt + Jj:e aogt)T2+TaogB)T+x + c = d\ogt + et + f, t> 0 

holds for some real constants a, b, c, d, e, f. Select a real number /i such 
that p/ < T(logB)r + X. Let A be an eigenvalue of T which is greater 
than 1 and let x be a corresponding unit eigenvector. Denote by P x the 
rank-one projection onto the subspace generated by x. Then A 2 P X < T 2 , 
so for t > 1 we have (log t)X 2 P x + jil < (log t) T z + T(logB) T + X. By the 
monotonicity of trace functions (see |T] 2.10. Theorem]) this implies that 

T re (1 °g f A 2 P x +/i/ < j J . g {\ogt)T 2 + T[logB)T+X t > i 

Therefore, the function t>~* Tre (logflr2+r(logB ^ r+;<: , t > 1 can be minorized 
by a function a t A + /) with some positive a and real number /3. Since A 2 > 
1, considering the equality (HD and letting t tend to infinity, we easily 
obtain a contradiction. Therefore, the eigenvalues of T are all less than 
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or equal to 1. However, the inverse of <p also preserves the von Neumann 
divergence and we have 

cp-\A) = e T ~ 1 ^ A)T ~ 1 - T ' lxT '\ AeP„ 

It follows that the eigenvalues of T~ l are also not greater than 1. We con¬ 
clude that T = I. 

We finally prove that X - 0. Let A = I and B be any element of which 
commutes with X. We obtain from (fT3l l that 

Tre x (-logJB + B-1) = Tr(-log£ + B - I). 

By the properties of the function x >-*■ -logx + x - 1, x > 0 (strictly in¬ 
creasing for x > 1, and takes the value 0 at 1) it is easy to see that any 
positive semidefinite operator D which commutes with X can be written 
as D = - logB + B-1 with some positive definite B which commutes with 
X. Consequently, we have 

Tre x D = TrD 

for any positive semidefinite operator DonC". This clearly implies that 
e x = I, i.e., X = 0. The proof of the theorem is complete. □ 

Let us mention here that the structure of all surjective maps on the 
set of nonsingular density operators (i.e., positive definite operators with 
trace 1) which preserve the relative entropy was determined in (71 Theo¬ 
rem 3]. The conclusion there is analogous to the conclusion here, those 
maps are unitary or antiunitary congruence transformations. Further¬ 
more, the fundamental idea of the proof there is basically the same as 
here although the details are rather different. 

We now turn to the preservers of Jensen divergences. Our general result 
reads as follows. 

Theorem 5. Let f be a differentiable strictly convex function on (0,oo), 
assume lim x ^o+ /(x) exists and finite and f is unbounded from above. 
Pick A e (0,1). Iff: V n — V n is a surjective map which satisfies 

Jf,X (</>(A),</>CB)) = Jf t x ( A,B ), A, B e P„, 

then there exists a unitary or antiunitary operator U : C” —>► C” such that 
f is of the form 

(p{A) = UAU*, AeP n . 

Proof Observe first that, by the assumptions on the function, / is mono- 
tonically increasing for large enough values of its variable. Next, we verify 
the following. 

Claim B. For any B,C e V n , the set 

{j ftX iA,B)-J fiX {A,C)\AeP n } 
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is bounded from below if and only if B < C. 

To prove the claim first observe that 

//,a(A B) - J f x {A, C) = (1 - A) (Tr f{B) - Tr/(C)) + 

+ Tr/(AA + (1 - A)C) -Tr f {XA + (1 - A)£), AeP n . 

Assume now that B < C and that there is a sequence (A fc ) in P„ such that 

Tr f{XA k + (1 - A)C) - Tr f{XA k + (1 - X)B) - -oo. 

Denote \ i = the eigenvalues of XA k + (1 - X)B and A[. , i = 

1,..., ft the eigenvalues of XA k + (1 - A)C both ordered in increasing or¬ 
der. By the Weyl inequality (see e.g. ® 4.3.3. Corollary]) it follows that 

lif < A[. fc) for all i and k. Moreover, we have L/ (/ (a^ 1 j -*■ 

-co which implies that the sequence fA^ j - / j j is not bounded 
from below for some i = 1 and hence it has a subsequence 

(/ fA|. fc;) j - / (m^)] ~” -oo. Since / is bounded from below, we deduce 

that -* oo. This implies that fif l) — oo and hence X^ l A - 

^ 0 for all but finitely many indexes l which is a contradiction. 
Conversely, if B C, then there exists a unit vector x e C” such that 
(Bx,x) > (Cx,x >. Set e = ({B - C)x,x) and let P x denote the orthogonal 
projection onto the one-dimensional subspace generated by x. Let m be 
a positive number such that ml - (1 - A) C is positive definite. For any 
t > 0 set 

A t :=\{ml+ tP x -{l-X)C). 

A 

JfMAt.B ) - J f ,x{A t> C) = (1 - A) (Tr f{B) - Tr/(C)) 

+ Tr/ {XA t + (1 - A)C) - Tr/(AA f + (1 - A )B) 

= (1 - A) (Tr/(B)-Tr/(C)) 

+ Tr f {ml + tP x ) -Trf {ml + tP x + (1 - X){B - C)). 

Let {x, y 2 , y 3 ,..., y„} be an orthonormal basis in C n . For 2 < j < n, denote 
by ajj the diagonal matrix elements of ml + tP x + (1 - X){B - C) relative 
to this basis, that is, 

ajj := ({ml + tP x + {l -X){B-C))yj,yj) = ({ml + (1- X){B - C)) yj,yj). 
Note that ajj is independent of t for 2 < j and 

((m/+ tP x + (1 - A)(£ - C)) x,x) = m+ t + { l-X)e. 


Now 

(15) 



13 


By Peierls inequality (see |T} 2.9. Theorem]), for the convex function / we 
have 

n 

f[m+ 7+(l-A)e) + X/(a 7 y) < Tr/(m/+7P X + (1 - A)(£ - C)). 

7=2 

On the other hand, it is apparent that 

Tr f[ml+ tP x ) = f{m+ f) + {n - 1 )f{m). 

Therefore, by CCD, 

Jf,xiA t} B) - J fi \[A t , C) < f{m + 7) - f{m + 7 + (1 - A)e) + TO, C), 
where 

TO, C) = (1 - A) (Tr/(B) - Tr/(C)) + {n- 1 )/(m) - £ /(«//) 

7=2 

is independent of the parameter 7. Since f is unbounded from above, 
hence 

lim /(m + 7) - f[m + 7 + (1 - A)e) = -oo, 

1—1-oo 

and this completes the proof of our claim. 

Using the characterization of the order given in Claim B, we see that 
the transformation 0 is an order automorphism of P„ and hence it is of 
the form 

<p{A) = TAT*, Ae P„, 

where T is an invertible linear or conjugate-linear operator on C”. We 
consider only the case where T is linear and prove that then T is neces¬ 
sarily unitary. 

As already mentioned, the unitary-antiunitary congruence transfor¬ 
mations preserve the Jensen divergences. Hence, by polar decomposi¬ 
tion, we can assume that T is a positive definite operator. 

Any power of <p is also a divergence preserver, so for every positive in¬ 
teger n we have that 

Tr A f{T n AT n ) + (1 - X)f{T n BT n ) - f{T n {AA+ (1 - A )B) T n ) 

(lb) =TrA/(A) + (l-A)/(£)-/(AA+(l-A)B), A,BeP n . 

Since / is continuously extendible onto [0,oo), thus we can insert any 
positive semidefinite operators A, B in the equality above. Assume T has 
an eigenvalue, say s, which is greater than 1 and x is a corresponding unit 
eigenvector. As before, denote by P x the orthogonal projection onto the 
subspace generated by x. Plug A - P x and B = 0 into CE) ■ We have 

Xf{s 2n )-f{Xs 2n ) = c 
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with some constant c. One can easily check that the function t >->• A/(f) - 
/(Af) is monotonically increasing (just differentiate and use that /' is in¬ 
creasing). Since s 2n — oo, it follows that 

A/(f) -/(At) = c, t> 0. 

We deduce A/'(f) = /'(At) A, t > 0 and this implies that /' is constant. We 
obtain that / is affine, a contradiction. It follows that T has no eigenvalue 
which is greater than 1. Since 0 _1 also preserves the A - / Jensen diver¬ 
gence, we have that the eigenvalues of T~ l are also not greater than 1. It 
follows that T - I and the proof of the theorem is complete. □ 

Let us again consider the three most important examples x >—• x p {p > 
1), x —>■ xlogx-x, x— log x of generating functions. The first two do sat¬ 
isfy the conditions in our theorem, hence the corresponding preservers 
are unitary-antiunitary congruence transformations. As for the third one, 
it does not satisfy the conditions (not bounded below), but the Jensen di¬ 
vergence in that case is of the form Q}, see the discussion before Propo¬ 
sitional By [12. Theorem 2], a surjective map 0 : —>■ P„ preserves the 
corresponding Jensen divergence (i.e., Chebbi-Moakher log-determinant 
a-divergence) if and only if there is an invertible linear or conjugate lin¬ 
ear operator T : C" —>■ C” such that 0 is of the form 

0(A) = TAT*, AeP n . 

In closing the paper we finally remark that the main result in our pa¬ 
per (TO) where we have described the structure of all transformations on 
any dense subset of density operators that preserve the Holevo bound is 
closely related to Theorem[5]in the particular case where the function / is 
x | —*■ xlogx-x, x > 0. Indeed, the Holevo bound of the ensemble {A, 1 - A} 
is just the corresponding Jensen divergence. We admit that the proof in 
ITOl is of completely different character. 

References 

1. E. Carlen, Trace inequalities and quantum entropy: an introductory course, Con- 
temp. Math. 529 (2010), 73-140. 

2. Z. Chebbi and M. Moakher, Means ofhermitian positive-definite matrices based on 
the log-determinant a-divergence function, Linear Algebra Appl. 436 (2012), 1872- 
1889. 

3. O. Hatori and L. Molnar, Isometries of the unitary groups and Thompson isometries 
of the spaces of invertible positive elements in C* -algebras, J. Math. Anal. Appl. 409 
(2014), 158-167. 

4. R.A. Horn and C.R. Johnson, Matrix Analysis, Cambridge University Press, 1990. 

5. A. Jarai, Regularity Properties of Functional Equations in Several Variables, Advances 
in Mathematics (Springer), Springer, New York, 2005. 


15 


6. L. Molnar, Order-automorphisms of the set of bounded observables, J. Math. Phys. 42 
(2001), 5904-5909. 

7. L. Molnar, Order automorphisms on positive definite operators and a few applica¬ 
tions, Linear Algebra Appl. 434 (2011), 2158-2169. 

8. L. Molnar, Jordan triple endomorphisms and isometries of spaces of positive definite 
matrices, Linear Multilinear Alg. 63 (2015), 12-33. 

9. L. Molnar, General Mazur-Ulam type theorems and some applications, in Operator 
Semigroups Meet Complex Analysis, Harmonic Analysis and Mathematical Physics, 
W. Arendt, R. Chill, Y. Tomilov (Eds.), Operator Theory: Advances and Applications, 
Vol. 250, to appear. 

10. L. Molnar and G. Nagy, Transformations on density operators that leave the Holevo 
bound invariant, Int. J. Theor. Phys. 53 (2014), 3273-3278. 

11. L. Molnar, G. Nagy and P. Szokol, Maps on density operators preserving quantum 
f -divergences, Quantum Inf. Process. 12 (2013), 2309-2323. 

12. L. Molnar and P. Szokol, Transformations on positive definite matrices preserving 
generalized distance measures, Linear Algebra Appl. 466 (2015), 141-159. 

13. E Nielsen and R. Bhatia (Eds), Matrix Information Geometry, Springer, Heidelberg, 
2013. 

14. J. Pitrik and D. Virosztek, On the joint convexity of the Bregman divergence of matri¬ 
ces, Lett. Math. Phys. 105 (2015), 675-692. 

15. R.T. Rockafellar, Convex Analysis, Princeton University Press, Princeton, 1970. 

16. S. Sra, Positive definite matrices and the S-divergence, preprint, arXiv math.FA- 
1110.1773v4. Shorter version of the manuscript is submitted to Proc. Amer. Math. 
Soc. 

Department of Analysis, Bolyai Institute, University of Szeged, H-6720 
Szeged, Aradi vertanuk tere 1., Hungary and MTA-DE “Lendulet” Functional 
Analysis Research Group, Institute of Mathematics, University of Debrecen, 
H-4010 Debrecen, P.O. Box 12, Hungary 
E-mail address: molnarlOmath. u-szeged. hu 
URL: http://www.math.unideb.hu/~molnarl/ 

Institute of Mathematics, Budapest University of Technology and Eco¬ 
nomics, H-1521 Budapest, Hungary 
E-mail address: pitrikOmath. bine . hu 
URL: http://www.math.bme.hu/“pitrik 

Institute of Mathematics, Budapest University of Technology and Eco¬ 
nomics, H-1521 Budapest, Hungary 
E-mail address: viroszOmath. bme. hu 
URL: http://www.math.bme.hu/~virosz 



